Bayesian Causal Inference with Bipartite Record Linkage
نویسندگان
چکیده
In some scenarios, the observational data needed for causal inferences are spread over two files. particular, we consider scenarios where one file includes covariates and treatment measured on a set of individuals, second responses another, partially overlapping individuals. absence error-free direct identifiers like social security numbers, straightforward merging separate files is not feasible, so that records must be linked using error-prone variables such as names, birth dates, demographic characteristics. Typical practice in situations generally follows two-stage procedure: first link probabilistic linkage technique, then make with dataset. This does propagate uncertainty due to imperfect linkages inference, nor it leverage relationships among study improve quality linkages. We propose joint model simultaneous Bayesian inference effects addresses these deficiencies. Using simulation studies theoretical arguments, show can accuracy estimated effects, well record linkages, compared modeling option. illustrate constructed debit card possession household spending.
منابع مشابه
Bayesian Estimation of Bipartite Matchings for Record Linkage
The bipartite record linkage task consists of merging two disparate datafiles containing information on two overlapping sets of entities. This is non-trivial in the absence of unique identifiers and it is important for a wide variety of applications given that it needs to be solved whenever we have to combine information from different sources. Most statistical techniques currently used for rec...
متن کاملBayesian Parametric and Nonparametric Inference for Multiple Record Linkage
Record linkage is an historically important statistical problem arising when data about some population of individuals is spread over several files. As kids, we grew up with the game “Where in the world is Carmen San Diego”? Nowadays, the name of the game for the U.S. Census Bureau and other organizations is who’s the real Steve Fienberg, where they are dealing with deciding if someone named St...
متن کاملHierarchical Bayesian Record Linkage Theory
In record linkage, or exact file matching, one compares two or more files on a single population for purposes of unduplication or production of an enhanced, merged database. Record linkage has many applications, including in population enumeration efforts, to create databases for epidemiological investigations, and to improve survey sample frames. Latent class and mixture models have been used ...
متن کاملBayesian Matching for Causal Inference
In this paper we provide Bayesian matching methods for finding the causal effect of a binary intake variable x ∈ {0, 1} on an outcome of interest y. One technique we introduce is a Bayesian variant of the classic Rosenbaum and Rubin (1983, 1984) propensity score matching method. We show how it is possible to find the posterior distribution of the Bayesian matched sample average treatment effect...
متن کاملAn Experiment in naïve Bayesian Record Linkage
Sharing data can represent a risk of disclosing sensitive information about the individuals which the data sets concern. Computationally complex techniques can be used by a socalled ‘data intruder’ to link such data and discover information about targeted individuals. Heuristic approaches to limiting this risk are aimed towards the more casual intruder. A knowledgeable intruder, armed with data...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Bayesian Analysis
سال: 2022
ISSN: ['1936-0975', '1931-6690']
DOI: https://doi.org/10.1214/21-ba1297